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O ' Abstract 

o . 

OA . An unsatisfiable formula is called minimal if it becomes satisfiable whenever any of its clauses 

' are removed. We construct minimal unsatisfiable fc-SAT formulas with f2(n fc ) clauses for k > 3, 

thereby negatively answering a question of Rosenfeld. This should be compared to the result of 
\ Lovasz [2] which asserts that a critically 3-chromatic k- uniform hypergraph can have at most {J^^} 

t^j- ■ edges. 

o. 

O ■ 1 Introduction 

Given n boolean variables x\, . . . ,x n , a literal is a variable Xi or its negation Xi (1 < i < n). A clause 
is a disjuction of literals and by k-clause we denote a clause of size k. A CNF( Conjunctive Normal 
Form) formula is a conjunction of clauses and a k-SAT formula is a CNF formula with only A:-clauses. 
Throughout this article formula will mean a CNF formula and it will be given as a pair F = (V, C) 
with variables V = {x\, . . . , x n } and clauses C as collection of disjunction of literals VUV. A formula 
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£S) , is called satisfiable if there exists an assignment of values to variables so that the formula becomes 

true. A formula is called minimal unsatisfiable if it is not satisfiable but removing any clause makes 
it satisfiable. 

Satisfiablity of a formula is closely related to the 2-colorability of a hypergraph in the following 
sense. A formula is satisfiable if there is an assignment of values to variables in a way that no 
clauses have only false literals inside it. Similarily a hypergraph is 2-colorable if there is a way to 
.!_h \ color the vertices into two colors so that none of the edges become monochromatic. A hypergraph 

H = (V, E) is called critically 3-chromatic if it is not 2 colorable but the deletion of any edge makes 
it 2 colorable. In this analogy, minimal unsatisfiable formulas correspond to critically 3-chromatic 
hypergraphs. Therefore it is natural to ask if similar results hold for both problems. In particular, we 
are interested if the same restriction on the number of clauses (edges, respectively) holds or not. 

In the case of lower bounds, roughly the same estimate holds for both formulas and hypergraphs. 
Seymour [3J used linear algebra method to deduce that a critically 3-chromatic hypergraph H = (V, E) 
must satisfy \E\ > \V\ if there is no isolated vertex. The corresponding bound for CNF formulas 
appeared in Aharoni and Linial [3] where they proved that minimal unsatisfiable formula F = (V, C) 
must satisfy |C| > \V\ + 1 if every variable is contained in some clause. 

For uniform hypergraphs there are also known upper bound results. Lovasz [2j proved that any 
critically 3-chromatic /c-uniform hypergraph has at most edges. This result is asymptotically 
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tight, as was shown by Toft [lj who constructed critically 3-chromatic fc-uniform hypergraphs with 
J7(n fc_1 ) edges. 

Motivated by these results, Rosenfeld [5] asked if the analogy also holds for minimal unsatisfiable 
/c-SAT formulas. 

Question. Should minimal unsatisfiable k- SAT formulas have at most 0{n k ~ l ) clauses? 

It is not difficult to show that this conjecture is true for k = 2 and we will give the simple proof 
of this in section [2j However for /c-SAT formulas with k > 3 we show that surprisingly the answer 
for the question of Rosenfeld is negative. In section 3 we will construct minimal unsatisfiable /c-SAT 
formulas with Q(n k ) clauses. 

2 2-SAT formulas 

First we give explicit minimal unsatisfiable 2-SAT formulas. Consider the 2-SAT formula = 
(y( 2 ), C( 2 >) where = {y 1: y 2 ,..., y 2l } and = {y t V y i+1 , y { V y l+1 :i = 1, 2, . . . , 21 - 1} U { Vl V 
V21} U {Vi V V2i}- is unsatisfiable because if g/, = for some i then either y$ V y%+\ or y { Vy j+1 
is false and otherwise if yi = for all 1 < i < 21 — 1 then y\ = y 2l and this time either y\ V y% 
or yl V y 2 i will become false. To prove that F^ 2 > is minimal unsatisfiable, we only check that deleting 
2/1 V y2 or y\ V y 2 \ makes the new formula satisfiable as other clauses can be checked similarily. In 
each case, the assignment of (yi = 2/2 = false, 2/3 = ... = y2i~i = true, 7/4 = . . . = y% = false) and 
(j/i = 2/3 = . . . = J/2Z-1 = false, y 2 = 2/4 = • • • = y2i = true) will make the remaining clauses true. 

Next we prove the linear upper bound of number of clauses in minimal unsatisfiable 2-SAT formulas. 

Proposition 1. Minimal unsatisfiable 2-SAT formulas have at most An clauses. 

Proof. Given a minimal unsatisfiable 2-SAT formula F = (V, C), let's consider the implication graph 
D of this 2-SAT formula which is the directed graph D over the vertices Fuy with two directed edges 
corresponding to each clause z\ V z 2 £ C given as ~2\ — > z 2 and ^2 — > Aspvall, Plass and Tarjan 
[6] proved that 2-SAT is unsatisfiable if and only if its implication graph has a strongly connected 
component which contains both X{ and afj for some index i. Therefore the unsatisfiability of F implies 
the existence of directed path from Xi to Xi and from Xj and Xi in D for some index i. Now observe 
that the minimality of F forces every clause zi V z 2 G C to have at least one of its corresponding 
edge zi — > z 2 or ~z 2 — > zi in these directed paths. As otherwise deleting the clause will not change the 
unsatisfiability of (because it still contains both directed paths). Since D has 2n vertices, there can 
be at most 4n edges in the two directed paths. Therefore |C| < An. □ 

3 /c-SAT formulas 

In this section we construct minimal unsatisfiable /c-SAT formulas on n variables with Q(n k ) clauses. 
For simplicity we describe in details the construction of 3-SAT formulas only. This construction can 
be easily generalized for all k. Informally, start with a minimal unsatisfiable "almost" 3-SAT formula 
with J7(n 3 ) clauses where "almost" means that only a small number of clauses is not of size 3. Then 
transform this formula into a "genuine" 3-SAT formula by replacing the clauses of size greater than 
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3 by 3-clauses while keeping the minimal unsatisfiable property. During the process the number of 
variables will not increase too much and therefore we will end up with a 3-SAT formula that we have 
promised. Now we should make it into a formal argument. 

The following lemma will allow us to change the size of a clause in the formula. This lemma is a 
modified version of Theorem 1 and 4 in [lj which were originally used by Toft to construct fc-uniform 
hypergraphs with ^(n k ~ 1 ) edges. 

Let Fx = (Vx, Cx), Fy = (Vy, Cy) be formulas with disjoint sets of variables (that is, Vxf^Vy = 0) 
and Co = z\ V Z2 V . . . z\. G Cx be a /c-clause of Fx where k < \Cy\. For an arbitrary surjective map h 
from Cy to {zi, Z2, . . . , Z}-}, let the formula Fz = (Vz, Cz) be as following. 

• V z = V x U V Y C z = (C x \{c }) U {c y V h(c y ) : c y G C y } 

Lemma 2. If Fx and Fy are minimal unsatisfiable formulas, then Fz constructed as above is also a 
minimal unsatisfiable formula. 

Proof. Let's first show that Fz is unsatisfiable. For arbitrary values of Vx there must exist a clause 
Cx G Cx which is false. If c x ^ c then we are done as c x G Cz so assume that c x = c. Since every 
literal x G cq is false, a new clause c y V h(c y ) is true if and only if c y is true. But Fy is unsatisfiable 
so there must exist a clause c y which is false and therefore Fz is unsatisfiable. 

Next we prove that removing any clause c z G Cz makes Fz satisfiable. First asuume that c z G 
Cx\{co}. Then give values to Vx so that every clause in Cx except c z is satisfied. Since c z ^ Co, there 
must exist a literal x G Co which is true. Pick a clause d G h~ l (x) C Cy (h (x) is non-empty because 
h is surjective) and give Vy the values which make every clause except d in Cy true. Observe that 
every clause in Cx\{cq} except c z is true by values of Vx and every clause in {c y \/ h(c y ) : c y G C y } is 
true either by values of Vy or the literal x. Now assume that c z = d V x G {c y V h(c y ) : c y G C y } and 
give Vx the values which make every clause except Co true and give Vy the values which makes every 
clause except d true. This assignment of values will make every clause but c z G Cz true and thus we 
are done. □ 

Next step is to construct an "almost" 3-SAT formula with many clauses. Let Vq = {xi,X2, • • ■ , XQ m } 
and look at the formula Fq = (Vb, Co) with clauses given as, 

• Co = {xi 1 V Xi 2 V Xi A : 1 < i\ < 2m, 2m + 1 < %2 < 4m, 4m + 1 < is < 6m} 

U{xi V x 2 V . . . , x 2m } U {x 2m+ i V ... V x 4m } U {x 4m+ i V ... V x 6m } 

Informally, partition the variables V into three equal parts V\ , Vi , V$ and consider every clauses 
x\ V X2 V £3 with Xj G Vi and add three more clauses Vi, V2, V3. Note that this formula contains 
(2m) 3 + 3 clauses. 

Claim 3. Fq is a minimal unsatisfiable formula. 

Proof. Let's first prove that Fq is unsatisfiable. Assume that the three clauses x\ V X2 V . . . V X2m, 
X2m+i V ... V x~4 m , X4 m+ \ V ... V XQ m are all true. Then there must exist 1 < i\ < 2m, 2m + 1 < 12 < 
4m, 4m + 1 < 13 < 6m such that x^ = Xi 2 = Xi 3 = false. But this will make the clause x^ V Xi 2 V Xi 3 
false. Therefore Fq is unsatisfiable. 

Now assume that we remove a clause c. If c = Xj x V Xj 2 V Xi 3 for some £1,^2,^3 then assigning 
2^1 = = x h = false and everything else true will make the remaining part satisfiable. On the other 
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hand if c = x\ V X2 V . . . V x<im then assigning x\ = xi = . . . = x^m = true and everything else false 
will make the remaining part satisfiable. Similar assignment will work for clauses ~x~2m+i V ... V X4 m 
and x 4m+ i V ... V x 6m . □ 

Construction 

Note that the formula Fq is "almost" a 3-SAT formula in the sense that there are only three clauses 
whose size is not 3. Use Lemma [2] with Fx = Fq, cq = x\ V x~2 V . . . V x~2m £ Co and Fy = F^ where 
F^ is a minimal unsatisfiable 2-SAT formula with m variables and 2m clauses as constructed in 
section [21 The obtained formula F\ is a minimal unsatisfiable formula over 6m + m = 7m variables 
and has only two clauses whose size are not 3. (All new clauses are 3-clauses.) 

Repeat the same process with the remaining two 2m-clauses to obtain a minimal unsatisfiable 
formula F% whose every clause has size 3 i.e. F2 is a 3-SAT formula over n = 9m variables. Note that 
it still contains the original 3-clauses {x^ Vxj 2 Vxj 3 : 1 < i\ < 2m, 2m+l <%2< Am,A:m+\ < 13 < 6m}. 
There are 8m 3 = (|^) 3 such clauses and therefore this 3-SAT formula F2 contains 0(n 3 ) clauses. 

For k > 4, minimal unsatisfiable fc-SAT formulas with Q(n k ) clauses can be constructed similarily. 
Use F (fc) = (V (k) ,C ( k) ) where, 

• C^ k) = {x h V x i2 V . . . V Xi J : (t - l)m + 1 < i t < tm, 1 < t < k} 
U( U^ =1 {x (s _i) m+ i V X( s _i) m+2 V ... V x sm }) 

(k) 

By the same process as above one can verify that Fq is minimal unsatisfiable. Then replace the 
m-clauses by /c-clauses using Lemma [2] and minimal unsatisfiable (k — 1)-SAT formulas. The final 
formula will be a minimal unsatisfiable fc-SAT formula with 0(n fc ) clauses. Details are omitted. 
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